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Abstract 

The following network computing problem is considered. Source nodes in a directed acyclic network generate 
independent messages and a single receiver node computes a target function / of the messages. The objective is to 
maximize the average number of times / can be computed per network usage, i.e., the "computing capacity". The 
network coding problem for a single-receiver network is a special case of the network computing problem in which 
all of the source messages must be reproduced at the receiver. For network coding with a single receiver, routing is 
known to achieve the capacity by achieving the network min-cut upper bound. We extend the definition of min-cut 
to the network computing problem and show that the min-cut is still an upper bound on the maximum achievable 
rate and is tight for computing (using coding) any target function in multi-edge tree networks and for computing 
linear target functions in any network. We also study the bound's tightness for different classes of target functions. 
In particular, we give a lower bound on the computing capacity in terms of the Steiner tree packing number and a 
differnet bound for symmetric functions. We also show that for certain networks and target functions, the computing 
capacity can be less than an arbitrarily small fraction of the min-cut bound. 



"This work was supported by the National Science Foundation and the UCSD Center for Wireless Communications. 

The authors are with the Department of Electrical and Computer Engineering, University of California, San Diego, La Jolla, CA 92093-0407. 
(rathnam@ucsd.edu, massimo@ece.ucsd.edu, nikhil@ucsd.edu, zeger@ucsd.edu) 



Appuswamy-Franceschetti-Karamchandani-Zeger 



August 9, 2010 



1 Introduction 

We consider networks where source nodes generate independent messages and a single receiver node computes a target 
function / of these messages. The objective is to characterize the maximum rate of computation, that is the maximum 
number of times / can be computed per network usage. 
Giridhar and Kumar [18] have recently stated: 

"In its most general form, computing a function in a network involves communicating possibly correlated 
messages, to a specific destination, at a desired fidelity with respect to a joint distortion criterion dependent 
on the given function of interest. This combines the complexity of source coding of correlated sources, 
with rate distortion, different possible network collaborative strategies for computing and communication, 
and the inapplicability of the separation theorem demarcating source and channel coding." 

The overwhelming complexity of network computing suggests that simplifications be examined in order to obtain 
some understanding of the field. 

We present a natural model of network computing that is closely related to the network coding model of Ahlswede, 
Cai, Li, and Yeung [1,49]. Network coding is a widely studied communication mechanism in the context of network 
information theory. In network coding, some nodes in the network are labeled as sources and some as receivers. 
Each receiver needs to reproduce a subset of the messages generated by the source nodes, and all nodes can act as 
relays and encode the information they receive on in-edges, together with the information they generate if they are 
sources, into codewords which are sent on their out-edges. In existing computer networks, the encoding operations 
are purely routing: at each node, the codeword sent over an out-edge consists of a symbol either received by the node, 
or generated by it if is a source. It is known that allowing more complex encoding than routing can in general be 
advantageous in terms of communication rate [1, 22, 38]. Network coding with a single receiver is equivalent to a 
special case of our function computing problem, namely when the function to be computed is the identity, that is when 
the receiver wants to reproduce all the messages generated by the sources. In this paper, we study network computation 
for target functions different than the identity. 

Some other approaches to network computation have also appeared in the literature. In [8, 11, 12, 28, 34, 39] 
network computing was considered as an extension of distributed source coding, allowing the sources to have a joint 
distribution and requiring that a function be computed with small error probability. A rate-distortion approach to the 
problem has been studied in [10, 15,47]. However, the complexity of network computing has restricted prior work to 
the analysis of elementary networks. Networks with noisy links were studied in [3, 14, 16, 17, 19, 26, 35, 37, 50] and 
distributed computation in networks using gossip algorithms was studied in [4-6, 9, 27, 36]. 

In the present paper, our approach is somewhat (tangentially) related to the field of communication complexity 
[30, 48] which studies the minimum number of messages that two nodes need to exchange in order to compute a 
function of their inputs with zero error. Other studies of computing in networks have been considered in [18,43], but 
these were restricted to the wireless communication protocol model of Gupta and Kumar [20]. 

In contrast, our approach is more closely associated with wired networks with independent noiseless links. Our 
work is closest in spirit to the recent work of [31,40^42] on computing the sum (over a finite field) of source messages 
in networks. We note that in independent work, Kowshik and Kumar [29] obtain the asymptotic maximum rate of 
computation in tree networks and present bounds for computation in networks where all nodes are sources. 

Our main contributions are summarized in Section 1.3, after formally introducing the network model. 

1.1 Network model and definitions 

In this paper, a network AT consists of a finite, directed acyclic multigraph G = (V, £), a set of source nodes S — 
{a\, . . . , ag} C V, and a receiver p € V. Such a network is denoted by J\f = (G, S, p). We will assume that p ^ S 
and that the graph 1 G contains a directed path from every node in V to the receiver p. For each node u E V, let £i(u) 
and £ (u) denote the set of in-edges and out-edges of u respectively. We will also assume (without loss of generality) 
that if a network node has no in-edges, then it is a source node. 

1 Throughout the paper, we will use "graph" to mean a directed acyclic multigraph, and "network" to mean a single-receiver network. We may 
sometimes write S (G) to denote the edges of graph G. 



Page 1 of 26 



Appuswamy-Franceschetti-Karamchandani-Zeger 



August 9, 2010 



An alphabet A is a finite set of size at least two. For any positive integer m, any vector x G A rn , and any 

i G {1,2,..., m}, let xt denote the i-th component of x. For any index set I = {ii, i%, . . . , i q } C {1,2,..., m} with 

ii < i2 < ■ ■ ■ < i q , let Xi denote the vector (xi 1 ,Xi 2 , . . . ,Xi ) G A^ 1 ' . 

The network computing problem consists of a network Af and a target function f of the form 



f-A s 



B 



(see Definition 1.4 for some examples). We will also assume that any target function depends on all network sources 
(i.e. they cannot be constant functions of any one of their arguments). Let k and n be positive integers. Given a 
network Af with source set S and alphabet A, a message generator is any mapping 



5' 



A k 



For each source Ci, a(ai) is called a message vector and its components ct(cr^) 1 , . . . , are called messages. 2 

Definition 1.1. A (k,n) network code for computing a target function / in a network Af consists of the following: 
(i) For any node v G V — p and any out-edge e G S (v), an encoding function: 




A n if v is a source node 
otherwise 



(ii) A decoding function: 

\£i(p)\ 

ip : 1J . 4 b" . 

3=1 

Given a (£;, n) network code, every edge e G £ carries a vector z e of at most n alphabet symbols, 3 which is 
obtained by evaluating the encoding function hs e ^ on the set of vectors carried by the in-edges to the node and the 
node's message vector if it is a source. The objective of the receiver is to compute the target function / of the 
source messages, for any arbitrary message generator a. More precisely, the receiver constructs a vector of k alphabet 
symbols such that for each i G {1,2,..., k}, the i-th component of the receiver's computed vector equals the value of 
the desired target function / applied to the i-th components of the source message vectors, for any choice of message 
generator a. Let ei, e^, . . . , e^.^ p ^ denote the in-edges of the receiver. 

Definition 1.2. A (fc, n) network code is called a solution for computing f in Af (or simply a (k, n) solution) if the 
decoding function tp is such that for each j G {1,2, ... ,k} and for every message generator a, we have 




If there exists a (fc, n) solution, we say the rational number k/n is an achievable computing rate. 

In the network coding literature, one definition of the coding capacity of a network is the supremum of all achiev- 
able coding rates [7, 13]. We adopt an analogous definition for computing capacity. 

Definition 1.3. The computing capacity of a network Af with respect to target function / is 

C co d(A/', /) = supj— : 3 (k, n) network code for computing / in AfX. 



2 For simplicity, we assume that each source has exactly one message vector associated with it, but all of the results in this paper can readily be 
extended to the more general case. 

3 By default, we will assume that edges carry exactly n symbols. 
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Thus, the computing capacity is the supremum of all achievable computing rates for a given network Af and a 
target function /. Some example target functions are defined below. 

Definition 1.4. 



Target function / 


Alphabet A 


f(xi,.. 


• , x s 




Comments 


identity 


arbitrary 


(xi,... 


,x s ) 






arithmetic sum 


{0,1, .-.,9-1} 


X\ + x 2 + 


— h 


X s 


'+' is ordinary integer addition 


mod r sum 


{0,1, .-.,9-1} 


Xl®X 2 ® 


. . . © 


X s 


© is mod r addition 


histogram 


{0,1, .-.,9-1} 


(c ,ci, . . 




l) 


c i = \{j '■ x j = i}\ for each i G A 


linear 


any finite field 


a\X\ + a 2 x 2 - 




r Oj S X s 


arithmetic performed in the field 


maximum 


any ordered set 


max {a;i, 


..,x 


s} 





Definition 1.5. For any target function / : A s — > B, any index set I C {1,2, ... , s}, and any a,6 6 A^K we write 
a = b if for every x, y S A s , we have f(x) = f(y) whenever xj = a, yj = b, and Xj = yj for all j g 1 I. 

It can be verified that = is an equivalence relation 4 for every / and /. 

Definition 1.6. For every / and /, let Rtj denote the total number of equivalence classes induced by = and let 

$rj:AW ^{1,2,...,R U } 
be any function such that <f>ir(a) — $ij(b) iff a = b. 

That is, <&i t f assigns a unique index to each equivalence class, and 

Rij = |{*i,/(o) • 
The value of Rjj is independent of the choice of $/,/. We call Rjj the footprint size of / with respect to /. 

g(x) V 



X Y 



Figure 1 : X, Y are two sources with messages x and y respectively. X communicates g(x) to Y so that Y can compute 
a function / of x and y. 

Remark 1.7. Let I c = {1, 2, . . . , s} — /. The footprint size Rj j has the following interpretation (see Figure 1). 
Suppose a network has two nodes, X and Y, and both are sources. A single directed edge connects X to Y. Let X 
generate x S *4' 7 ' and Y generate y G A IC ' . X communicates a function g(x) of its input, to Y so that Y can compute 
f(a) where a £ A s , aj = x, and a/ c = y. Then for any x,x £ A^ 1 ^ such that x ^ x, we need g(x) ^ g(x). Thus 
\g | ^ Ri,f> which implies a lower bound on a certain amount of "information" that X needs to send to Y to 

ensure that it can compute the function /. Note that g = achieves the lower bound. We will use this intuition to 
establish a cut-based upper bound on the computing capacity C co d(A/", /) of any network Af with respect to any target 
function /, and to devise a capacity-achieving scheme for computing any target function in multi-edge tree networks. 



4 Witsenhausen [46] represented this equivalence relation in terms of the independent sets of a characteristic graph and his representation has 
been used in various problems related to function computation [11, 12,39]. Although = is defined with respect to a particular index set / and a 
function /, we do not make this dependence explicit - the values of / and / will be clear from the context. 
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Definition 1.8. A set of edges C C £ in network Af is said to separate sources a mi , ■ ■ ■ , <Jm d from the receiver p, if 
for each i G {1,2,..., d}, every directed path from a mi to p contains at least one edge in C. The set C is said to be a 
cut in A/" if it separates at least one source from the receiver. For any network Af, define A(Af) to be the collection of 
all cuts in Af. For any cut C G A (AT) and any target function /, define 

Ic = {i ■ C separates a; from the receiver} 
Rc.j = Ri c J- (2) 

Since target functions depend on all sources, we have Rcj > 2 for any cut C and any target function /. The 
footprint sizes Rc.f for some example target functions are computed below. 

A multi-edge tree is a graph such that for every node v G V, there exists a node u such that all the out-edges of v 
are in-edges to u, i.e., £ (v) C (e.g. see Figure 2). 




Figure 2: An example of a multi-edge tree. 



1.2 Classes of target functions 

We study the following four classes of target functions: (1) divisible, (2) symmetric, (3) A-exponential, (4) A-bounded. 

Definition 1.9. A target function / : A s — > B is divisible if for every index set / C {1, . . . , s}, there exists a finite 
set Bi and a function f 1 : .4J 7 — 5- Bi such that the following hold: 

(1) / {1 s} = / 

(2) |/ 7 (.Al J l)| < |/(^)| 

(3) For every partition {Ii, . . . , I y } of /, there exists a function 

(? : B^ x • • • x Bi — > Bi such that for every x G A' 1 ', we have 
/ J (^)=3(/ 7l (^ 1 ),...,/^K)). 

Examples of divisible target functions include the identity, maximum, mod r sum, and arithmetic sum. 

Divisible functions have been studied previously 5 by Giridhar and Kumar [18] and Subramanian, Gupta, and 
Shakkottai [43]. Divisible target functions can be computed in networks in a divide-and-conquer fashion as follows. 
For any arbitrary partition {Ji, . . . , 7 7 } of the source indices {1, . . . , s}, the receiver p can evaluate the target function 
/ by combining evaluations of f 11 , . . . , f 1 " 1 . Furthermore, for every i = 1, . . . , 7, the target function / /; can be 
evaluated similarly by partitioning Ij and this process can be repeated until the function value is obtained. 

Definition 1.10. A target function / : A s — > B is symmetric if for any permutation tt of {1, 2, . . . , s} and any vector 

x G A s , 

f(xi,x 2 , ...,x s ) = /(a; 7r (i),x 7r ( 2 ), . . .,x^ s )). 

5 The definitions in [18,43] are similar to ours but slightly more restrictive. 
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That is, the value of a symmetric target function is invariant with respect to the order of its arguments and hence, it 
suffices to evaluate the histogram target function for computing any symmetric target function. Examples of symmetric 
functions include the arithmetic sum, maximum, and mod r sum. Symmetric functions have been studied in the 
context of computing in networks by Giridhar and Kumar [18], Subramanian, Gupta, and Shakkottai [43], Ying, 
Srikant, and Dullerud [50], and [26]. 

Definition 1.11. Let AG (0, 1]. A target function / : A s — > B is said to be X-exponential if its footprint size satisfies 

Rij > \A\ m for every / C {l,2,...,s}. 
Let A £ (0, oo). A target function / : ,4 s — > B is said to be X-bounded if its footprint size satisfies 

Rij < \A\ X for every I C {1,2,..., s}. 

Example 1.12. The following facts are easy to verify: 

• The identity function is 1-exponential. 

• Let A be an ordered set. The maximum (or minimum) function is 1-bounded. 

• Let A = {0, 1, . . . , q — 1} where q > 2. The mod r sum target function with q > r > 2 is log g r-bounded. 

Remark 1.13. Giridhar and Kumar [18] defined two classes of functions: type-threshold and type-sensitive functions. 
Both are sub-classes of symmetric functions. In addition, type-threshold functions are also divisible and c-bounded, for 
some constant c that is independent of the network size. However, [18] uses a model of interference for simultaneous 
transmissions and their results do not directly compare with ours. 

Following the notation in Leighton and Rao [33], the min-cut of any network Af with unit-capacity edges is 

I CI 

min-cut(A0 = min - — -. (3) 

ceA(A0 \Ic\ 

A more general version of the network min-cut plays a fundamental role in the field of multi-commodity flow [33,44]. 
The min-cut provides an upper bound on the maximum flow for any multi-commodity flow problem. The min-cut is 
also referred to as "sparsity" by some authors, such as Harvey, Kleinberg, and Lehman [22] and Vazirani [44]. We 
next generalize the definition in (3) to the network computing problem. 

Definition 1.14. If M is a network and / is a target function, then define 

I CI 

min-cut (Af, f) — min . (4) 

ceA(A0 log|_4| R C j 

Example 1.15. 

• If / is the identity target function, then 

|C| 

min-cut (Af, f) = min 7 — 7 . 

V J ' CGA(A0 I Jo I 

Thus for the identity function, the definition of min-cut in (3) and (4) coincide. 

• Let A= {0,1, . . . ,q — 1}. If / is the arithmetic sum target function, then 

|C| 

min-cut (A/", /) = min . . — . (5) 

V ,J> CGA(A0 log g ((g-l)|Jo| + l) 

• Let A be an ordered set. If / is the maximum target function, then 

min-cut(A/\ f) = min ICI . 

CGA(Af) 
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1.3 Contributions 

The main results of this paper are as follows. In Section 2, we show (Theorem 2.1) that for any network Af and any 
target function /, the quantity min-cut(A/", /) is an upper bound on the computing capacity C C0C j(AA, /). In Section 3, 
we note that the computing capacity for any network with respect to the identity target function is equal to the min-cut 
upper bound (Theorem 3.1). We show that the min-cut bound on computing capacity can also be achieved for all 
networks with linear target functions over finite fields (Theorem 3.2) and for all multi-edge tree networks with any 
target function (Theorem 3.3). For any network and any target function, a lower bound on the computing capacity is 
given in terms of the Steiner tree packing number (Theorem 3.5). Another lower bound is given for networks with 
symmetric target functions (Theorem 3.7). In Section 4, the tightness of the above-mentioned bounds is analyzed 
for divisible (Theorem 4.2), symmetric (Theorem 4.3), A-exponential (Theorem 4.4), and A-bounded (Theorem 4.5) 
target functions. For A-exponential target functions, the computing capacity is at least A times the min-cut. If every 
non-receiver node in a network is a source, then for A-bounded target functions the computing capacity is at least a 
constant times the min-cut divided by A. It is also shown, with an example target function, that there are networks 
for which the computing capacity is less than an arbitrarily small fraction of the min-cut bound (Theorem 4.7). In 
Section 5, we discuss an example network and target function in detail to illustrate the above bounds. In Section 6, 
conclusions are given and various lemmas are proven in the Appendix. 

2 Min-cut upper bound on computing capacity 

The following shows that the maximum rate of computing a target function / in a network Af is at most min-cut(A^, /) . 
Theorem 2.1. If Af is a network with target function f, then 

C cod (Af,f)<rmn-cut(Af,f). 

Proof. Let the network alphabet be A and consider any (k, n) solution for computing / in Af. Let C be a cut and 
for each i e {1,2,..., k}, let a (i) , &W e A^ Ia K Suppose j G {1,2,..., k} is such that a® ^ 6 (i) , where = is the 
equivalence relation from Definition 1.5. Then there exist x, y E A s satsifying: f(x) ^ f(y), xi c — a > Vic = & > 
and Xi = yi for every i ^ Iq. 

The receiver p can compute the target function / only if, for every such pair {a' 1 ' , . . . , } and {&W , . . . , b^ } 
corresponding to the message vectors generated by the sources in Ic, the edges in cut C carry distinct vectors. Since 
the total number of equivalence classes for the relation = equals the footprint size Rc,f, the edges in cut C should 
carry at least (Rcj) k distinct vectors. Thus, we have 

A n M > {Rcjf 

and hence for any cut C, 

k < id 

" ~ lo S|^| R c,f ' 

Since the cut C is arbitrary, the result follows from Definition 1.3 and (4). ■ 

The min-cut upper bound has the following intuition. Given any cut C £ A(Af), at least log^ Rcj units of 
information need to be sent across the cut to successfully compute a target function /. In subsequent sections, we 
study the tightness of this bound for different classes of functions and networks. 

3 Lower bounds on the computing capacity 

The following result shows that the computing capacity of any network Af with respect to the identity target function 
equals the coding capacity for ordinary network coding. 
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Theorem 3.1. If M is a network with the identity target function f, then 

Ccod{N, f) = min-cut (TV, /) = min-cut (AT)- 

Proof. Rasala Lehman and Lehman [32, p. 6, Theorem 4.2] showed that for any single-receiver network, the conven- 
tional coding capacity (when the receiver demands the messages generated by all the sources) always equals the min- 
cut(TV). Since the target function / is the identity, the computing capacity is the coding capacity and min-cut(7V, /) = 
min-cut (AT), so the result follows. ■ 

Theorem 3.2. If M is a network with a finite field alphabet and with a linear target function f, then 

Ccod{N, f) = min-cut(Af, /). 

Proof. Follows from [41, Theorem 2]. ■ 

Theorems 3.1 and 3.2 demonstrate the achievability of the min-cut bound for arbitrary networks with particular 
target functions. In contrast, the following result demonstrates the achievability of the min-cut bound for arbitrary 
target functions and a particular class of networks. The following theorem concerns multi-edge tree networks, which 
were defined in Section 1.1. 

Theorem 3.3. If M is a multi-edge tree network with target function f, then 

Ccod(Af, I) = min-cut(Af, /). 

Proof. Let A be the network alphabet. From Theorem 2. 1, it suffices to show that C C0l i(Af, /) > min-cut(A/", /). Since 
£ (v) is a cut for node v £ V — p, and using (2), we have 

min-cut(A/\ /) < min ^MJ . ( 6 ) 

v £ V-p l0g|_4| Rs ( v )J 

Consider any positive integers k, n such that 

k . \£ (y)\ 

— < mm . (7) 

n vev-p log lAl R l£oiv)J 

Then we have 

|_4||£ o 0)l« > R h l£ ^ )f for every node v 6 V-p. (8) 

We outline a (fc,?i) solution for computing / in the multi-edge tree network N. Each source a.i 6 S generates a 
message vector a(cfi) 6 A k . Denote the vector of i-th components of the source messages by 

a:W = (a(tri) 4 , • • • ^(cr^). 

Every node v G V — {p} sends out a unique index (as guaranteed by (8)) over A^ Sa ^ n corresponding to the set of 
equivalence classes 

$ W/(42 oW ) for I e{l, •••,*}. (9) 

If v has no in-edges, then by assumption, it is a source node, say ix, . The set of equivalence classes in (9) is a 
function of its own messages a^o^); for I £ {1, . . . , k}. On the other hand if v has in-edges, then let ui, U2, ••• ,Uj 
be the nodes with out-edges to v. For each i £ {1, 2, ■ • ■ , j}, using the uniqueness of the index received from Uj, node 
v recovers the equivalence classes 



(^/^/IL,,) for l &{!,-■■ ,k}. (10) 
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Furthermore, the equivalence classes in (9) can be identified by v from the equivalance classes in (10) (and a(v) if v 
is a source node) using the fact that for a multi-edge tree network Af, we have a disjoint union 

j 

I £o(v) = IJ h {ui)- 

i=l 

If each node v follows the above steps, then the receiver p can identify the equivalence classes &i e . (p) j (x^) 
for i £ {1, . . . , k}. The receiver can evaluate f(x"') for each I from these equivalence classes. The above solution 
achieves a computing rate of k/n. From (7), it follows that 

Ccod(AA,/)> min ■ ( n > 

ve v-p log lA{ Ri £oM j 



We next establish a general lower bound on the computing capacity for arbitrary target functions (Theorem 3.5) 
and then another lower bound specifically for symmetric target functions (Theorem 3.7). 

For any network Af = (G, S, p) with G = (V, £), define a Steiner tree 6 of Af to be a minimal (with respect to 
nodes and edges) subgraph of G containing S and p such that every source in S has a directed path to the receiver p. 
Note that every non-receiver node in a Steiner tree has exactly one out-edge. Let T{Af) denote the collection of all 
Steiner trees in Af. For each edge e £ £(G), let J e = {i : ti £ T(Af) and e £ £(tj)}. The fractional Steiner tree 
packing number n(AT) is defined as the linear program 

f u t > V ^ £ T(W) , 
n(A0=max Ul sub J ectto \ Vu*<l Ve££(G). (12) 

Note that H(Af) > 1 for any network Af, and the maximum value of the sum in (12) is attained at one or more vertices 
of the closed polytope corresponding to the linear constraints. Since all coefficients in the constraints are rational, the 
maximum value in (12) can be attained with rational m's. The following theorem provides a lower bound 7 on the 
computing capacity for any network Af with respect to a target function / and uses the quantity H(Af). In the context 
of computing functions, Ui in the above linear program indicates the fraction of the time the edges in tree ti are used 
to compute the desired function. The fact that every edge in the network has unit capacity implies j u i — 1- 

Lemma 3.4. For any Steiner tree G' of a network Af, let Af 1 = (G', 5*, p). Let C' be a cut in Af'. Then there exists a 
cut C in Af such that Ic = Ic- 

(Note that Ic is the set indices of sources separated in Af' by C'. The set Ic may differ from the indices of 
sources separated in Af by C' .) 

Proof. Define the cut 

C= |J £o{ct ¥ ). (13) 

C is the collection of out-edges in Af of a set of sources disconnected by the cut C' in Af'. If i £ Ic, then, by (13), C 
disconnects er; from p in Af, and thus Ic C Ic- 

Let o~i be a source, such that i £ Ic and Let P be a path from at to p in Af. From (13), it follows that there 
exists i' £ Ic such that P contains at least one edge in £ (ai>). If P also lies in Af' and does not contain any edge 



6 Steiner trees are well known in the literature for undirected graphs. For directed graphs a "Steiner tree problem" has been studied and our 
definition is consistent with such work (e.g., see [25]). 

7 In order to compute the lower bound, the fractional Steiner tree packing number Tl(M) can be evaluated using linear programming. Also note 
that if we construct the reverse multicast network by letting each source in the original network Af become a receiver, letting the receiver in the 
become the only source, and reversing the direction of each edge, then it can be verified that the routing capacity for the reverse multicast network 
is equal to fl(A^). 



Page 8 of 26 



Appuswamy-Franceschetti-Karamchandani-Zeger 



August 9, 2010 



in C", then ov has a path to p in TV that does not contain any edge in C", thus contradicting the fact that ov 6 Ic 
Therefore, either P does not lie in TV or P contains an edge in C . Thus cr; € Ic , i.e., Ic Q Ic ■ I 

Theorem 3.5. If Af is a network with alphabet A and target function f, then 

C CO d(Af, f) > n(7V) 



CeA(Af) log ^| R C j 

Proof Suppose TV = (G, S, p). Consider a Steiner tree G' = (V, £') of TV, and let TV = (G", 5", p). From Lemma 
3.4 (taking C" to be £ (v) in TV 7 ), we nave 

V v 6 V' - p, 3 C e A (TV) such that 7^ o(l)) = 7/ c . (14) 

Now we lower bound the computing capacity for the network TV with respect to target function /. 

C cod (Af', /) = min-cut(TV', /) [from Theorem 3.3] (15) 

1 



v e v-p log|_4| Ri' £o(vy f 



[from Theorem 2. 1,(6), (11)] 



> min [from (14)]. (16) 

ceA(A0 log^i Ri c j 

The lower bound in (16) is the same for every Steiner tree of N. We will use this uniform bound to lower bound the 
computing capacity for Af with respect to /. Denote the Steiner trees of AT by t±, . . . , tr- Let e > and let r denote 
the quantity on the right hand side of (16). On every Steiner tree ti, a computing rate of at least r — e is achievable by 
(16). Using standard arguments for time-sharing between the different Steiner trees of the network Af, it follows that 
a computing rate of at least (r — e) ■ n(7V) is achievable in A/", and by letting e — >• 0, the result follows. ■ 

The lower bound in Theorem 3.5 can be readily computed and is sometimes tight. The procedure used in the proof 
of Theorem 3.5 may potentially be improved by maximizing the sum 

r Ui > o v u e r (ao , 

J2 UiU subject to <^ Vw 4 <l Vee£(G) ( 17 > 

where r,; is any achievable rate 8 for computing / in the Steiner tree network A/i = (ti, S, p). 

We now obtain a different lower bound on the computing capacity in the special case when the target function is the 
arithmetic sum. This lower bound is then used to give an alternative lower bound (in Theorem 3.7) on the computing 
capacity for the class of symmetric target functions. The bound obtained in Theorem 3.7 is sometimes better than that 
of Theorem 3.5, and sometimes worse (Example 3.8 illustrates instances of both cases). 

Theorem 3.6. If Af is a network with alphabet A = {0, 1, . . . , q — 1} and the arithmetic sum target function f, then 

C cod (Af,f)> min |C ' 

ceA(A0 log g P qiS 

where P q s denotes the smallest prime number greater than s(q — 1). 

Proof. Let p = P C[:S and let Af' denote the same network as Af but whose alphabet is F p , the finite field of order p. 

Let e > 0. From Theorem 3.2, there exists a (fc, n) solution for computing the F p -sum of the source messages in 
TV' with an achievable computing rate satisfying 

- > min \C\ - e. 

n ceA(A0 



8 From Theorem 3.3, Ti can be arbitrarily close to min-cut(£;, /). 
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This (k, n) solution can be repeated to derive a (ck, an) solution for any integer c > 1 (note that edges in the network 
Af carry symbols from the alphabet A = {0, 1, ...,</ — 1}, while those in the network TV' carry symbols from a 
larger alphabet F p ). Any (ck, an) solution for computing the F p -sum in Af' can be 'simulated' in the network Af by a 
[ck, \cn log, p] ) code (e.g. see [2]). Furthermore, since p> s(q — 1) + 1 and the source alphabet is {0, 1, . . . , q — 1}, 
the Fp-sum of the source messages in network Af is equal to their arithmetic sum. Thus, by choosing c large enough, 
the arithmetic sum target function is computed in Af with an achievable computing rate of at least 

min I CI 

ceA(AQ' _ 2e 

Since e is arbitrary, the result follows. ■ 
Theorem 3.7. If Af is a network with alphabet A = {0, 1, . . . , q — 1} and a symmetric target function f, then 



C C od(Af, f) > 



min I CI 



(9-1). log, P(«) 
where P(s) is the smallest prime number 9 greater than s. 

Proof. From Definition 1.10, it suffices to evaluate the histogram target function / for computing /. For any set of 
source messages (xi,x 2 , ■ ■ ■ ,x s ) e -4 s , we have 

f(xi,...,x s ) = (c ,ci, ...,c 5 _i) 

where Cj = |{j : Xj = i}\ for each i e A. Consider the network Af' = (G, S, p) with alphabet A' = {0, 1}. Then for 
each i E A, c% can be evaluated by computing the arithmetic sum target function in Af' where every source node <jj is 
assigned the message 1 if Xj — i, and otherwise. Since we know that 

9-1 

the histogram target function / can be evaluated by computing the arithmetic sum target function q — 1 times in the 
network Af' with alphabet A' = {0, 1}. Let e > 0. From Theorem 3.6 in the Appendix, there exists a (fc, n) solution 
for computing the arithmetic sum target function in Af' with an achievable computing rate of at least 

min ICI 
k ^ ceAfjV) 

— > e. 

n log 2 P(s) 

The above (fc, n) solution can be repeated to derive a (ck, an) solution for any integer c > 1. Note that edges in the 
network Af carry symbols from the alphabet A= {0, 1, . . . , q — 1}, while those in the network Af' carry symbols from 
A' = {0, 1}. Any (ck, cn) code for computing the arithmetic sum function in Af' can be simulated in the network Af 
by a (ck, [cn log, 2]) code 10 . Thus by choosing c large enough, the above-mentioned code can be simulated in the 

network Af to derive a solution for computing the histogram target function / with an achievable computing rate 11 of 
at least 

min |C| 

1 1 ceA(A0' 2e 



(q-1) log, 2 l0g 2 P( S ) 

Since e is arbitrary, the result follows. 



'From Bertrand's Postulate [21, p. 343], we have P(s) < 2s. 
10 To see details of such a simulation, we refer the interested reader to [2]. 

"Theorem 3.7 provides a uniform lower bound on the achievable computing rate for any symmetric function. Better lower bounds can be found 
by considering specific functions; for example Theorem 3.6 gives a better bound for the arithmetic sum target function. 
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P 

Mi 



M; 



Figure 3: The Reverse Butterfly Network A2 has two binary sources {<7i, 02} and network A3 has three binary sources 
{01, 02,03}, each with A = {0, 1}. Each network's receiver p computes the arithmetic sum of the source messages. 



Example 3.8. Consider networks A/2 and A3 in Figure 3, each with alphabet A = {0, 1} and the (symmetric) 
arithmetic sum target function /. Theorem 3.7 provides a larger lower bound on the computing capacity C co d(A/2, /) 
than Theorem 3.5, but a smaller lower bound on C C od(A/3, /). 

• For network A/9 (in Figure 3), we have max Rn t = 3 and min ICI = 2, both of which occur, for example, 

ceA(M) ceA(aF) 

when C consists of the two in-edges to the receiver p. Also, (q — 1) log„ P(s, q) = log 2 3 and n(A/") = 3/2, so 



Ccod(M,/) > (3/2)/log 2 3 
C C od(AA 2 ,/) >2/log 2 3 



[from Theorem 3.5] 
[from Theorem 3.7]. 



(18) 



In fact, we get the upper bound C co d (A/2 , /) < 2/ log 2 3 from Theorem 2.1, and thus from ( 1 8), C co d (A/2 , /) = 
2/log 2 3. 

For network A3, we have max Rn f = 4 and min ICI = 1, both of which occur when C = Uai, p)}. 

CGA(Af) ,J CSA(A0 

Also, (q - 1) log 9 P(s, q) = log 2 5 and II (AA) = 1, so 

Ccod(AA 3 ,/) > l/log 2 4 
Ccod(AA 3 ,/) > l/log 2 5 

From Theorem 3.3, we have C CO d(A/"3, /) = 1/ log 2 4. 



[from Theorem 3.5] 
[from Theorem 3.7]. 



Remark 3.9. An open question, pointed out in [7], is whether the coding capacity of a network can be irrational. 
Like the coding capacity, the computing capacity is the supremum of ratios k/n for which a (k,n) solution exists. 
Example 3.8 demonstrates that the computing capacity of a network (e.g. A/2) with unit capacity links can be irrational 
when the target function is the arithmetic sum function. 



4 On the tightness of the min-cut upper bound 

In the previous section, Theorems 3.1 - 3.3 demonstrated three special instances for which the min-cut(A/", /) upper 
bound is tight. In this section, we use Theorem 3.5 and Theorem 3.7 to establish further results on the tightness of the 
min-cut(A/", /) upper bound for different classes of target functions. 

The following lemma provides a bound on the footprint size Rj j for any divisible target function /. 
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Lemma 4.1. For any divisible target function f : A s — > B and any index set I C {1, 2, . . . , s}, the footprint size 
satisfies 

Rij<\f(A s )\- 

Proof. From the definition of a divisible target function, for any / C {1,2,..., s}, there exist maps f 1 , f 1 , and g 
such that 

f{x)=g(f I {x I ),f I \x I .)) Vxe/ 

where I c = {1, 2, . . . , s} — I. From the definition of the equivalence relation = (see Definition 1.5), it follows that 
a, b G .A' 7 ' belong to the same equivalence class whenever f I (a) = / / (&). This fact implies that Rij < |/ 7 (-4' 7 ') |- 
We need j/ 7 (*4' z ') | < \ f {A s )\ to complete the proof which follows from Definition 1.9(2). ■ 

Theorem 4.2. If Af is a network with a divisible target function f, then 

C cod {M, f) > ■ min-cut(AA, /) 

\£i(p)\ 

where £%{p) denotes the set of in-edges of the receiver p. 
Proof. Let A be the network alphabet. From Theorem 3.5, 

C C od(AA, /) > n(7V) ■ min 



ceA(Af) log^i R C j 

^ ' 1 TFTHm [fromLemma 4.1]. (19) 

log|^| \ f(A s )\ 

On the other hand, for any network Af, the set of edges £i (p) is a cut that separates the set of sources S from p. Thus, 

min-cut(7V, /) < [from (4)] 

lo g|^| R £,(p),f 

[from I £ . (p) = S and Definition 1.6]. (20) 



log|^||/(^ s )| 
Combining (19) and (20) completes the proof. 

Theorem 4.3. If Af is a network with alphabet A = {0, 1, . . . , q — 1} and symmetric target function f, then 

C md {Af, f) > ^\ Rf p( : • min-cut(AA, /) 

(q - 1) • log g P(s) 

where P(s) is the smallest prime number greater than s and 12 

Rr= min Ri t . 

1 /C{l,...,s} 

Proof. The result follows immediately from Theorem 3.7 and since for any network Af and any target function /, 
min-cut(7V, f) < — min ICI [from (4) and the definition of R A. 

V \og q R f CGA(A^)' ' /J 



The following results provide bounds on the gap between the computing capacity and the min-cut for A-exponential 
and A-bounded functions (see Definition 1.11). 



12 From our assumption, Rj > 2 for any target function /. 
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Theorem 4.4. If X £ (0,1] and Af is a network with a X-exponential target function f, then 

C cod (Af, f) > A ■ min-cut(A/\ /). 

Proof. We have 

ICI 

min-cut(A/\ f ) = min 

ceA(Af) log lA{ R C j 

< min — r [from f being A-exponentiall 

~ CeA(AT) A|I C | 

= — • min-cut(A/") [from (3)]. 

A 

Therefore, 

min-cut(A/", /) 1 min-cut(A/") 

C cod (AA,/) "A' C cod (AT,/) 
1 

< - 
" A 

where the last inequality follows because a computing rate of min-cut(TV) is achievable for the identity target func- 
tion from Theorem 3.1, and the computing capacity for any target function / is lower bounded by the computing 
capacity for the identity target function (since any target function can be computed from the identity function), i.e., 
(Af,f) > min-cut(TV). ■ 

Theorem 4.5. Let A > 0. IfAf is a network with alphabet A and a X-bounded target function f, and all non-receiver 
nodes in the network Af are sources, then 

log M Rt 

CcodW, f) > ^' 1 ■ min-cut(AA, /) 

where 

Rt = min Rt r. 

1 JC{l,...,s} ' J 

Proof. For any network Af such that all non-receiver nodes are sources, it follows from Edmond's Theorem [45, p.405, 
Theorem 8.4.20] that 

II(A0 = min |C| . 

ceA(A0 

Then, 



C C od(-A/", /) > min ICI- min [from Theorem 3.51 

CeA(AT) ceA(AT) log i ^ i R C j 

\C\ 

> min — — [from f being A-boundedl . (21) 

~ ceA(A^) A 

On the other hand, 

ICI 



min-cut(A/", /) = min 



ceA(A0 log|^| R C j 

mm — 

CeA(Af) log|^| R f 



\C\ 

< min - — [from the definition of Rf]. (22) 
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Combining (21) and (22) gives 



min-cutCAf, /) . \C\ 1 

< mm 



CcodCAf,/) CGA(A^) log,,, R f min ±A 

w 1 CeA(AT) A 



A 



lo g|^| R f 



Since the maximum and minimum functions are 1-bounded, and Rf = \A\ for each, we get the following corollary. 

Corollary 4.6. Let A be any ordered alphabet and let Af be any network such that all non-receiver nodes in the 
network are sources. If the target function f is either the maximum or the minimum function, then 

Ccod{N,f) = min-cut(AA,/). 



Theorems 4.2 - 4.5 provide bounds on the tightness of the min-cut(AA, /) upper bound for different classes of 
target functions. In particular, we show that for A-exponential (respectively, A-bounded) target functions, the com- 
puting capacity C co d(Af, f) is at least a constant fraction of the min-cut(A/", /) for any constant A and any network 
Af (respectively, any network AT where all non-receiver nodes are sources). The following theorem shows by means 
of an example target function / and a network Af, that the min-cut(A/\ /) upper bound cannot always approximate 
the computing capacity C co d(Af, /) up to a constant fraction. Similar results are known in network coding as well as 
in multicommodity flow. It was shown in [33] that when s source nodes communicate independently with the same 
number of receiver nodes, there exist networks whose maximum multicommodity flow is 0(1/ logs) times a well 
known cut-based upper bound. It was shown in [23] that with network coding there exist networks whose maximum 
throughput is 0(1/ logs) times the best known cut bound (i.e. meagerness). Whereas these results do not hold for 
single-receiver networks (by Theorem 3.1), the following similar bound holds for network computing in single-receiver 
networks. The proof of Theorem 4.7 uses Lemma 7.1 which is presented in the Appendix. 

Theorem 4.7. For any e > 0, there exist networks Af such that for the arithmetic sum target function f, 



1 

(logs) 



Ccod(Af, f) = 0( ) • min-cut(AA, /). 



Proof. Note that for the network N m .l and the arithmetic sum target function /, 

C 

min-cut^Mx, /) = min [from (5)]. 

Ceh{M M ,L) log 2 (\Ic\ + 1) 

Let m be the number of sources disconnected from the receiver p by a cut C in the network Af M . L . For each such 
source a, the cut C must contain the edge (a, p) as well as either the L parallel edges (er, ero) or the L parallel edges 
(a ,p). Thus, 

min-cut(A/; /x , /) = min ( L + ™ \ . (23) 

l<m<M L log 2 (m + 1) J 
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Let m* attain the minimum in (23) and define c*= min-cut(A( M ,£, /). Then, 

m + 1 



c* / In 2 > min 



i<m<M [ ln(m + 1) 
L = c* log 2 (m* + 1) — m 



— \ > min(-^-l 
1) x>2 llnx) 



> mm 



x>2 I X — 1 



> 1 



[from (23)] 



(24) 



where (24) follows since the function c* log 2 (x + l) — x attains its maximum value over (0, oo) at x — (c* / In 2) — 1. 
Let us choose L = [(log A/) 1 ^ 6 / 2 )] . We have 



L = 0(min-cut(7V A/x , /) log 2 (nun-cut(A/ M , i! /))) 
min-cut(AA MX , /) = fi((log M) 1 -") 

e m d(^M,L,f) = o(i) 

i 



o 



(logAf)i- 



-cut(Af M . L J) 



[from (24)] (25) 
[from (25)] (26) 
[from Lemma 7.1] 

[from (26)]. 



5 An example network 




Figure 4: Network TV has three binary sources, ax, <J2, and 03 with A = {0, 1} and the receiver p computes the 
arithmetic sum of the source messages. 

In this section, we evaluate the computing capacity for an example network and a target function (which is divisible 
and symmetric) and show that the min-cut bound is not tight. In addition, the example demonstrates that the lower 
bounds discussed in Section 3 are not always tight and illustrates the combinatorial nature of the computing problem. 

Theorem 5.1. The computing capacity of network M with respect to the arithmetic sum target function f is 



C 



cod 



log 2 3 ' 



Proof. For any (k, n) solution for computing /, let w^ 1 , w^ 1 , £ {0, l} fe denote the message vectors generated 
by sources <7\, u-x, cr 3 , respectively, and let zi,z 2 G {0, 1}™ be the vectors carried by edges (ai,p) and (02, p), 
respectively. 
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Consider any positive integers k, n such that k is even and 

k 2 

- < T— 1 n- (27) 

n 1 + log 2 3 

Then we have 

2™ > 3 fc / 2 2 fe / 2 . (28) 
We will describe a (k, n) network code for computing / in the network Jv. Define vectors y^> , y^ 2 > G {0, l} k by: 

(1) _ ( wP + toj 3) if 1 < i < k/2 
Vl ~\ wP if fc/2 < z < 

(2) _ f wf ] if 1 < i < fc/2 
Vl ~ \ wf ] + wf ] if k/2 < i < k. 

The first k/2 components of j/ 1 -* can take on the values 0, 1, 2, and the last k/2 components can take on the values 
0, 1, so there are a total of 3 fc / 2 2 fc / 2 possible values for y^\ and similarly for y( 2 h From (28), there exists a mapping 
that assigns unique values to z\ for each different possible value of yW, and similarly for z 2 and y( 2 K This induces a 
solution for TV" as summarized below. 

The source 03 sends its full message vector w^ 3 ' (k < n) to each of the two nodes it is connected to. Source 
<7i (respectively, a 2) computes the vector j/ 1 ) (respectively, j/ 2 - 1 ), then computes the vector z\ (respectively, z 2 ), and 
finally sends z\ (respectively, z 2 ) on its out-edge. The receiver p determines j/ 1 ) and y&l from z\ and z 2 , respectively, 
and then computes j/ 1 ) + y^ 2 \ whose i-th component is w\ + + w[ 3 \ i.e., the arithmetic sum target function 
/. The above solution achieves a computing rate of k/n. From (27), it follows that 

^^ITlbr (29) 



We now prove a matching upper bound on the computing capacity C co d U\, fj . Consider any {k, n) solution for 
computing the arithmetic sum target function / in network TV. For any p £ {0, 1, 2, 3} fc , let 

A p = {( Zl ,z 2 ) : v>M + w (2) + w (3) = p}. 

That is, each element of A p is a possible pair of input edge-vectors to the receiver when the target function value 
equals p. 

Let j denote the number of components of p that are either or 3. Without loss of generality, suppose the first j 
components of p belong to {0, 3} and define € {0, l} fe by 

( 3 ) r if Pl g {0, 1} 

1 ifpiG{2,3}. 



Let 

and notice that 



T = w {2) ) 6 {0, l} k x {0, l} k : w {1) + w {2) + w {3} = p} 

{(z u z 2 ) : (w (1) ,w (2) ) eT,w^ =w^} CAp. (30) 



If + w (2) + u> (3) = p, then: 

(i) pi — = implies wp = wf^ = 0; 

(ii) pi — = 2 implies wp = u^ 2 ' = 1; 

(iii) pi — u>P = 1 implies (w^ , twj ) = (0, 1) or (1, 0). 
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Thus, the elements of T consist of /c-bit vector pairs (u/ 1 ), u/ 2 ') whose first j components are fixed and equal (i.e., 
both are when pi = and both are 1 when pi = 3), and whose remaining k — j components can each be chosen 
from two possibilities (i.e., either (0, 1) or (1, 0), whenpi € {1, 2}). This observation implies that 



\T\ = 2 k - j . 



(31) 



Notice that if only u;W changes, then the sum u>W + + changes, and so Z\ must change (since z 2 is not 
a function of u/ 1 ') in order for the receiver to compute the target function. Thus, if vP~> changes and u/ 3 ' does not 
change, then z\ must still change, regardless of whether changes or not. More generally, if the pair (w^ 1 ' , w^ 1 ) 
changes, then the pair (z\, Z2) must change. Thus, 



and therefore 



[( Zl ,z 2 ):{w^,W^)&T,w^ =w^} 

\A P \ > \{{zi,z 2 ) ■ (w (1 \w (2) ) £T,u/ 3 ' 

>\T\ 
= 2 k ~ j . 

We have the following inequalities: 

4™ > \{(zi,z 2 ) : w {1 \w {2 \w {3) € {0,l} fc } 

= E 

pS{0,l:2,3} fc 

= E E w 

J=° pe{0,l,2,3} fc 



> \T\ 



[from (30)] 
[from (32)] 
[from (31)] 



> 



\{i: Pi G{0,3}}\=j 



E E ^ ] 

J=0 pG{0,l-2,3} fc 

\{i: Pi €{0,3}}\=j 

j=0 V7/ 
6 fc 



[from (33)] 



(32) 



(33) 



(34) 



(35) 



where (34) follows since the A p 's must be disjoint in order for the receiver to compute the target function. Taking 
logarithms of both sides of (35), gives 

k 2 



- < 



n 1 + log 2 3 



which holds for all k and n, and therefore 



1 + log 2 3 



(36) 



Combining (29) and (36) concludes the proof. 
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Corollary 5.2. For the network M with the arithmetic sum target function f, 




Proof. Consider the network M depicted in Figure 4 with the arithmetic sum target function /. It can be shown that 
the footprint size Rc.f = \Ic\ + 1 f° r an y cut C, and thus 

min-cut/TV", /) = 1 [from (5)]. 



The result then follows immediately from Theorem 5.1. 



Remark 5.3. In light of Theorem 5 . 1 , we compare the various lower bounds on the computing capacity of the network 
J\f derived in Section 3 with the exact computing capacity. It can be shown that II ^A/^j = 1. If / is the arithmetic sum 
target function, then 



Thus, this example demonstrates that the lower bounds obtained in Section 3 are not always tight and illustrates the 
combinatorial nature of the problem. 

6 Conclusions 

We examined the problem of network computing. The network coding problem is a special case when the function 
to be computed is the identity. We have focused on the case when a single receiver node computes a function of the 
source messages and have shown that while for the identity function the min-cut bound is known to be tight for all 
networks, a much richer set of cases arises when computing arbitrary functions, as the min-cut bound can range from 
being tight to arbitrarily loose. One key contribution of the paper is to show the theoretical breadth of the considered 
topic, which we hope will lead to further research. This work identifies target functions (most notably, the arithmetic 
sum function) for which the min-cut bound is not always tight (even up to a constant factor) and future work includes 
deriving more sophisticated bounds for these scenarios. Extensions to computing with multiple receiver nodes, each 
computing a (possibly different) function of the source messages, are of interest. 




[from Theorem 3.5] 



[from Theorem 3.7] 



[from Theorem 4.2] . 
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7 Appendix 

Define the function 



q ■■ n* ' 1 ^ — ►{o,i,...,M} fc 



i = l 



as follows. For every a = (a^ , , . . . , a^ 1 ^ ) such that each € {0, 1}' 



Q{a) J = 4 for ever y j e {1, 2, . . . , fc}. 



(37) 



M 



We extend Q for X C JJ{0, l} fe by defining = {Q(a) : a £ X}. 

i=l 

We now present Lemma 7.1. The proof uses Lemma 7.2, which is presented thereafter. We define the following 
function which is used in the next lemma. Let 



for x > 1 (38) 
-cclog 2 x — (1 — x) log 2 (l — x). Note that 



4 

where H" 1 denotes the inverse of the binary entropy function H(x) = 
j(x) is an increasing function of x. 

Lemma 7.1. If lim - — — = 0, then lim C co d{M ML , f) = 1. 

M^roG log 2 M M->oa 

Proof. For any M and L, a solution with computing rate 1 is obtained by having each source Ui send its message 
directly to the receiver on the edge (<7j, p). Hence C co i(Af M , L , /) > 1. Now suppose that Af MiL has a (fc, n) solution 
with computing rate k/n > 1 and for each i g {1,2,..., M}, let 



5l : {0, l} fc — > {0, 1}" 
be the corresponding encoding function on the edge (uj, p). Then for any A\, A%, 



,A M Q {0, l} fc , we have 



M 



• 2 nL > 



M 



(39) 



Each Ai represents a set of possible message vectors of source <Tj. The left-hand side of (39) is the maximum number 
of different possible instantiations of the information carried by the in-edges to the receiver p (i.e., \gi (Ai) \ possible 
vectors on each edge (ai, p) and 2 nL possible vectors on the L parallel edges (orj, p))- The right-hand side of (39) is 
the number of distinct sum vectors that the receiver needs to discriminate, using the information carried by its in-edges. 
For each «e {1,2,..., M}, let z t G {0, 1}™ be such that \g~ x (zi)\ > 2 k ~ n and choose A t = g^ 1 (zi) for each i. 

M 

Also, let = Y[ M- Then we have 

i=i 



0(c/ (M) ) 



< 2 



n L 



[from| ffl (A 2 ) | = land (39)]. 



(40) 



Thus (40) is a necessary condition for the existence of a (k,n) solution for computing / in the network M m ,l- 
Lemma 7.2 shows that 13 

Q(U^ > (M+ l) 7(fc/n)fc (41) 



One can compare this lower bound to the upper bound |Q(C/< A/ >) | < (M + l) k which follows from (37). 
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where the function 7 is defined in (38). Combining (40) and (41), any (k, n) solution for computing / in the network 
Nm.l with rate r = k/n > 1 must satisfy 



r 7 (r) Iog 2 (Af +1) < -log 2 q(u^ 
n V 



< L. 



From (42), we have 



r 7 (r) < 



log 2 (M + l) 



(42) 



(43) 



The quantity r 7 (r) is monotonic increasing from to 00 on the interval [1, 00) and the right hand side of (43) goes 
to zero as M —> 00. Thus, the rate r can be forced to be arbitrarily close to 1 by making M sufficiently large, i.e. 
C co d(A/'j U x, /) < 1. In summary, 

lim C co &(N M .L,f) = 1. 

M — >-oo 



Lemma 7.2. Let k, n, M be positive integers such that k > n. For each i 6 {1,2,..., M }, let A4 C {0, l} fc fee swc/j 



,1/ 



f/iflf |^| > 2 k - n and let = JJ^. 77ie«, 

i=l 

Q(V (M) ) > (A/ + l) 7(fc/ " )fc . 

Proof. The result follows from Lemmas 7.4 and 7.7. ■ 

The remainder of this Appendix is devoted to the proofs of lemmas used in the proof of Lemma 7.2. Before we 
proceed, we need to define some more notation. For every j G {1,2,..., fc}, define the map 



h& : {0,1,..., M} k — ► {0,1,..., MY 



by 



max {0, pi — 1} if i = j 
^ otherwise. 



(44) 



That is, the map /iw subtracts one from the j-th component of the input vector (as long as the result is non-negative) 
and leaves all the other components the same. For every j S {1,2,..., k}, define the map 



by 



Define 



by 



Note that 



Cj) : x {0, l} fe — > {0, l} k 



^(A,a) = \ hii) ^ V AC{0,lfand ae{ 0,lf. 
a otherwise 



0(3) . 2 {04} fc _^ 2 {0,1}" 

'(A) = {^')(i,a) : a G A} 
= |A| . 



(45) 



(46) 
(47) 
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A set A is said to be invariant under the map <jyj) if the set is unchanged when <jP' is applied to it, in which case from 
(45) and (46) we would have that for each a E A, 

!i (il (fl)£i (48) 

Lemma 7.3. For any A C {0, l} k and all integers m and t such that 1 < m < t < k, the set f^* -1 ) (• • • <fp-> (^4))) 
is invariant under the map (j)^ m \ 

Proof. For any A' C {0, l} fc , we have 

0W(^W(A')) = (i) (i4') ViG {1,2,...,*}. (49) 

The proof of the lemma is by induction on t. For the base case t = 1, the proof is clear since ^'(^'(A)) = 
c/)^(A) from (49). Now suppose the lemma is true for all t < r (where r > 2). Now suppose t = r. Let 
B = ^ T ~ 1 \(j)^- 2 \---^ 1 \A))). Since 0< T )(>M(B)) = ^ T \B) from (49), the lemma is true when m = t = r. 
In the following arguments, we take m < r. From the induction hypothesis, B is invariant under the map <ft( m \ i.e., 

cj)( m )(B)=B. (50) 

Consider any vector c € (j)^ (B). From (48), we need to show that (c) € <^ T ' (B). We have the following cases. 

c r = 1 : c, /i (r) (c) G B [from c T = landcG 4> (t) {B)\ (51) 

/i (m) (c) £ B [from (50) and (51)] (52) 

h (r) (h( m \c)) = M m) (/i (T) (c)) G B [from (50) and (51)] (53) 

h {m) (c) G (r) (B) [from (52) and (53)] 

c r = : 3 6 G £ such that {b) = c [from c r = and c G (r) (B)] (54) 

h {m) (6) G B [from (50) and (54)] (55) 

h {m) (h^(bj\ = /i (T) (/i (m) (6)) G (T) (B) [from (55)] (56) 

/i (m) (c) G (t) (B) [from (54) and (56)] . 

Thus, the lemma is true for t = r and the induction argument is complete. ■ 

M 

Let Ai, A 2 , . . . , A M C {0, l} k be such that \Ai\ > 2 k ~ n for each i. Let U {M) = A and extend the definition 

i=l 

of 4>^ in (46) to products by 

M 

^(U^) = H^(A i ). 
i=i 

jj(m) j s to ^ e invariant under <j>W if 

^)([/(M) ) = [; (M)_ 

It can be verifed that £/( M ) is invariant under iff each A4 is invariant under For each i G {1, 2, . . . , M}, let 

Bi^w^-^G-.^A))) 

and from (47) note that 

\B l \ = \A l \>2 k ~ n . (57) 
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Let 



M 



and recall the definition of the function Q (37). 
Lemma 7.4. 



Proof. We begin by showing that 

For every p 6 {0, 1, ... , M} fc ~\ let 

<f{p) 
<Pi(p) 

and note that 



q(V (M) )| > |q(V (m) ) 



q(u {m) )\ > |q(^ (1) (c/ (m) ) 



U <p<p) 

p€{a,i,...,M} k - 1 

U pi(p) 

pe{o,i,...,A/} fc - 1 



>(^)(C/( M ))) = 



where the two unions are in fact disjoint unions. We show that for every p £ {0, 1, . . . , M} k 1 , 

\<p(p)\ > \<fi(p)\ 

which by (59) and (60) implies (58). 

If |y>i(p)| = 0, then (61) is trivial. Now consider any p £ {0, 1, . . . , M} k ~ 1 such that |<£>i(p)| > 1 and let 



K p = max{z : (i,pi, ■ ■ ■ ,Pk-i) € flip)} ■ 



Then we have 



\<Pi(p)\<K p + l. 

Since (K p ,pi, ■ ■ ■ ,Pk-i) € pi(p), there exists (a (1) , a (2) , . . . , a (M) ) G C/ (M) such that 

M 



Then from the definition of the map (/>W in (45), there are if p of the a^'s from amongst 
{aW,o( 2 ) ) .., ) flW} such that 4° = 1 and c/>W (i„aW) = fl W. Let / = {i X) i 2 , . . . , i Kp } C 
the index set for these vectors and let fiW = (a ^ ) for each i £ I. Then for each i £ /, we have 



(58) 



(59) 
(60) 

(61) 



(62) 



(63) 



£ Ai 



Let 



a« = (l,a« . ..,««) 
fi« = (o, . . . , ajj?) £ A t [from c^ 1 ) (A,,a^) = a« and (45)]. 

D S^M & (l) e{a W ,a (l) } forie/, 1_ . . 



(64) 
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From (63) and (64), for every r G R we have 



and thus 
Hence, we have 



n g {0,1,..., |/|}, 

n =pi V i g {2, 3, . . . , k} 

\R\ = \I\ + 1 = K p + 1. 



I^(P)I > 1^1 



A p + 1 



> \Mp)\ 

and then from (59) and (60), it follows that 



[from (64)] 
[from (65)] 
[from (62)] 



(65) 



(V (M) )| > |Q(0 (1) (f/ (M) ) 

For any A C {0, l} fe and any j G {1, 2, . . . , fc}, we know that | </>( J ) (.A) | C {0, l} fe . Thus, the same arguments as 
above can be repeated to show that 

q(^([/( m )))|>|q(^ 2 )(^)([/( m )))) 

> g(^ 3) (^ (2) (0 (1) (^ (M) ))) 



> 



q(V (m) ) 



For any s,r G Z fc , we say that s < r if s; < r; for every £ G {1, 2, . . . , k}. 
Lemma 7.5. Let p G Q(V( M )). If q G {0, 1, ... , A/} fe anc/q < p, then q G Q(V( A/ )). 

Proof. Since g < p, it can be obtained by iteratively subtracting 1 from the components of p, i.e., there exist t > 
and ii, ■ ■ . , it G {1, 2, . . . , k} such that 



Consider any i G {1,2,. ..,&}. We show that foW (p) g Q(l/ (M) ), which implies by induction that q G <2(V (M) ). If 
Pi = 0, then (p) = p and we are done. Suppose that pi > 0. Since p G Q(V^ M ^), there exists &W G £?j for every 
j G {1,2,..., M} such that 

M 

p = Y J b U) 

i=i 

and b\ = 1 for some m G {1,2,..., A/}. From Lemma 7.3, V( M ) is invariant under </>M and thus from (48), 
G B ro and 

m-l A/ 
j=l j=m+l 
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is an element of Q(F (M) ). 

The lemma below is presented in [3] without proof, as the proof is straightforward. 
Lemma 7.6. For all positive integers k, n, M, and 5 G (0, 1), 

k 



min T (1 + mi) > (M + l) Sk . (66) 

< mi < M, 1J- 

S« =1 ™,i > SMk 1=1 



For any a G {0, l} k , let \a\ H denote the Hamming weight of a, i.e., the number of non-zero components of a. The 
next lemma uses the function 7 defined in (38). 

Lemma 7.7. 

<3(V (M) )| > (7\/ + i) 7(fc/,l)fe . 

Proof. Let S = ^(k/n). The number of distinct elements in {0, l} fe with Hamming weight at most \5k\ equals 

L«5fcj 

= 2 (fc -™ )/2 [from (38)]. 

For each i G {1, 2, . . . , M}, > 2 k ~ n from (57) and hence there exists G 5* such that \b^\ H > 5k. Let 

M 



( k . ) < 2 fcw(5) [from [24, p.15, Theorem 1]] 



; = 1 

It follows that pj G {0, 1,2,..., M} for every j G {1,2,..., fc}, and 

k A/ 



i=i i=i 



> SMk. (67) 



The number of vectors g in {0, 1, ... , M} fc such that q di P equals J^J (1 + p^), and from Lemma 7.5, each such 
vector is also in Q(V^ M ^. Therefore, 

o(^)| >na+pj) 



> (M + l) Sk [from (67) and Lemma 7.6]. 



Since 6 = "/(k/n), the result follows. 
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